ERCIM “ Alain Bensoussan ” Fellowship Scientific Report
نویسنده
چکیده
The Affinity Propagation (AP) clustering algorithm proposed by Frey and Dueck (2007) provides an understandable, nearly optimal summary of a data set, through message passing among all pairs of data items. However, it suffers two major shortcomings: i) the number of clusters is vague with the user-defined parameter called self-confidence, and ii) the quadratic computational complexity. When aiming at a given number of clusters due to prior knowledge (e.g., number of classes in a supervised context), AP has to be launched many times until an appropriate setting of self-confidence parameter is found by a bisection method. The re-launched AP increases the computational cost by 1 order of magnitude. In this paper, we propose an algorithm, called K-AP, to exploit the immediate results of K clusters by introducing a constraint in the process of message passing. Through theoretical analysis and experimental validation, K-AP was shown to be able to directly generate K clusters as userdefined, with a negligible increase of computational cost compared to AP. In the meanwhile, KAP preserves the clustering quality as AP in terms of the distortion (sum of the squared distance between each data item and its assigned cluster center). It is more effective than k-mediods w.r.t. the distortion minimization and higher clustering purity. Adaptively detecting changes in autonomic grid computing. Xiangliang Zhang, Cecile Germain and Michele Sebag. In: Proceedings of 11th ACM/IEEE International Conference on Grid Computing (Grid 2010), workshop on Autonomic Computational Science. Abstract: Detecting the changes is the common issue in many application fields due to the non-stationary distribution of the applicative data, e.g., sensor network signals, web logs and grid running logs. Toward Autonomic Grid Computing, adaptively detecting the changes in a grid system can help to alarm the anomalies, clean the noises, and report the new patterns. In this paper, we proposed an approach of self-adaptive change detection based on the PageHinkley statistic test. It handles the non-stationary distribution without the assumption of data distribution and the empirical setting of parameters. We validate the approach on the EGEE streaming jobs, and report its better performance on achieving higher accuracy comparing to the other change detection methods. Meanwhile this change detection process could help to discover the device fault which was not claimed in the system logs. Detecting the changes is the common issue in many application fields due to the non-stationary distribution of the applicative data, e.g., sensor network signals, web logs and grid running logs. Toward Autonomic Grid Computing, adaptively detecting the changes in a grid system can help to alarm the anomalies, clean the noises, and report the new patterns. In this paper, we proposed an approach of self-adaptive change detection based on the PageHinkley statistic test. It handles the non-stationary distribution without the assumption of data distribution and the empirical setting of parameters. We validate the approach on the EGEE streaming jobs, and report its better performance on achieving higher accuracy comparing to the other change detection methods. Meanwhile this change detection process could help to discover the device fault which was not claimed in the system logs. Self-adaptive Change Detection in Streaming Data with Non-stationary Distribution. Xiangliang Zhang and Wei Wang. Accepted by International Conference on Advanced Data Mining and Applications (ADMA2010). Abstract: Non-stationary distribution, in which the data distribution evolves over time, is a common issue in many application fields, e.g., intrusion detection and grid computing. Detecting the changes in massive streaming data with non-stationary distribution helps to alarm the anomalies, clean the noises, and report the new patterns. In this paper, we employ a novel approach for detecting changes in streaming data with the purpose of improving the quality of modeling the data streams. The self-adaptability of the novel approach enhances the effectiveness of modeling data streams by timely catching the changes of distributions. We Non-stationary distribution, in which the data distribution evolves over time, is a common issue in many application fields, e.g., intrusion detection and grid computing. Detecting the changes in massive streaming data with non-stationary distribution helps to alarm the anomalies, clean the noises, and report the new patterns. In this paper, we employ a novel approach for detecting changes in streaming data with the purpose of improving the quality of modeling the data streams. The self-adaptability of the novel approach enhances the effectiveness of modeling data streams by timely catching the changes of distributions. We Page 3 sur 3 validated the approach on an online clustering framework with a benchmark KDDcup 1999 intrusion detection data set as well as a real-world grid data set. The validation results demonstrate its better performance on achieving higher accuracy and lower percentage of outliers comparing to the other change detection approaches. III -Attended Seminars, Workshops, and Conferences Please identify the name(s), date(s) and place(s) of the events in which you participated during your fellowship period.
منابع مشابه
Ercim " Alain Bensoussan " Fellowship Programme Scientific Report
During my fellowship, I collaborated with Prof. Ole Morten Aamo and Ass. Prof. Øyvind Stavdahl. One of the objective of the project was the development and investigation of methods for a computer based video sequence analysis in order to detect a certain kind of General Movements (GM) observed in the motion performed by infants with the aim to determine the risk to have Cerebral Palsy. Cerebral...
متن کاملPredicate Gates for Spatial Logic
The predicate logic gates are proposed for spatially modulated signals. Two voltage impulses are transmitted along the coupled wires, and they are considered as a predicate expression. The developed gates perform the logical operations AND, NOT and OR with these expressions. They are used for a predicate logic microprocessor. Key-Words: Artificial spatial intelligence, predicate logic, topologi...
متن کاملSerious work on playful experiences: a preliminary set of challenges
1 This work was carried out during the tenure of an ERCIM “Alain Bensoussan” Fellowship Programme ABSTRACT As work, leisure and social activities blend together, and amateur and professional practices becomes harder to distinguish, we need to explore the role of technology that works to support people in this rich range of everyday experiences. Incorporating fun, playful elements in the workpla...
متن کاملModel complexity reduction of chemical reaction networks using mixed-integer quadratic programming
The model complexity reduction problem of large chemical reaction networks under isobaric and isothermal conditions is considered. With a given detailed kinetic mechanism and measured data of the key species over a finite time horizon, the complexity reduction is formulated in the form of a mixed-integer quadratic optimization problem where the objective function is derived from the parametric ...
متن کاملDegenerate Dirichlet Problems Related to the Ergodic Property of an Elasto-Plastic Oscillator Excited by a Filtered White Noise
متن کامل
Coalitions of Arguments: An Approach with Constraint Programming
The aggregation of generic items into coalitions leads to the creation of sets of homogenous entities. In this paper we accomplish this for an input set of arguments, and the result is a partition according to distinct lines of thought, i.e., groups of “coherent” ideas. We extend Dung’s Argumentation Framework (AF) in order to deal with coalitions of arguments. The initial set of arguments is p...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010